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CROSS-REFERENCE TO RELATED APPLICATIONS 
[0001] Not Applicable 

STATEMENT OF FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT 
[0002] Not Applicable 
BACKGROUND OF THE INVENTION 

1. TECHNICAL FIELD 

[0003] This invention relates in general to electronic communications and, 
more particularly, to error correction using a Viterbi decoder. 

2. DESCRIPTION OF THE RELATED ART 

[0004] Many electronic devices use error correction techniques in conjunction 
with data transfers between components and/ or data storage. Error correction is 
used in many situations, but is particularly important for wireless data 
communications, where data can easily be corrupted between the transmitter 
and the receiver. In some cases, errant data is identified as such and 
retransmission is requested. Using more robust error correction schemes, 
however, errant data can be reconstructed without retransmission. 
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[0005] One popular error correction technique uses Viterbi decoding to detect 
errors in a data stream from a convolution encoder. A Viterbi decoder 
determines costs associated with multiple possible paths between nodes. After a 
specified number of stages, the node with the minimum associated cost is 
5 chosen, and a path is traced back through the previous stages. The data is 
decoded data based on the selected path. 

[0006] Actual implementations of Viterbi decoding use dedicated hardware, 
because the decoding is computationally intensive. More and more devices, 
however, are turning to DSPs (digital signal processors) to handle the 
10 computational chores. Additional circuitry dedicated to Viterbi decoding on a 
DSP is undesirable, because it adds to the cost of the DSP and the power 
consumed by the DSP. 

[0007] Accordingly, a need has arisen for a method and apparatus for 
performing Viterbi decoding in software. 
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BRIEF SUMMARY OF THE INVENTION 

[0008] The present invention performs a Viterbi decoding function by 
calculating candidate path metrics for states at time T n based on previously 
calculated path metrics for states at time T n -i and branch metrics associated with 
transitions between the states at time T n -i and states at time T n according to a first 
trellis, selecting path metrics for states at time T n from the candidate path metrics 
and calculating candidate path metrics for states at T n +i based on the selected 
path metrics for states at T n according to a second trellis, different from the first 
trellis. 



[0009] Using an asymmetrical trellis structure can provide efficiencies that 
allow a programmable processing device, such as a digital signal processor, to 
provide Viterbi decoding at high speeds. 
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BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS 

[0010] For a more complete understanding of the present invention, and the 
advantages thereof, reference is now made to the following descriptions taken in 
conjunction with the accompanying drawings, in which: 

[0011] Figure 1 is a example of a data communication connection used in the 
prior art; 

[0012] Figure 2 is a block diagram of a conventional data encoder; 

[0013] Figure 3 is a state diagram of the encoder of Figure 2; 

[0014] Figure 4 is a trellis diagram showing data transitions; 

[0015] Figure 5 is a trellis diagram showing the decoding of the data from the 
encoder of Figure 2; 

[0016] Figures 6a through 6d are trellis diagrams showing the calculation of 
path metrics through the trellis diagram; 

[0017] Figure 7 is a block diagram of a programmable processing device 
capable of multi-field arithmetic and logic operations; 

[0018] Figure 8 is a prior art trellis diagram for a 16-state Viterbi decoder; 

[0019] Figure 9 is an asymmetrical trellis pair used in conjunction with the 
processing device of Figure 7; 

[0020] Figures 10a through lOd are partial trellis diagrams showing Viterbi 
decoding for respective destination state groups; 

[0021] Figure 11 illustrates various registers used in the implementation of 
the Viterbi decoder; 
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[0022] Figure 12 illustrates a flow chart describing implementation of the first 
trellis of the trellis pair; 

[0023] Figure 13 illustrates a flow chart describing implementation of the 
second trellis of the trellis pair; and 

[0024] Figure 14a through Figure 14d illustrate partial trellis diagrams for a 
first destination state group the second trellis over respective passes. 
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DETAILED DESCRIPTION OF THE INVENTION 

[0025] The present invention is best understood in relation to Figures 1 - 14 of 
the drawings, like numerals being used for like elements of the various 
drawings. 

[0026] Figure 1 illustrates a general block diagram of communications 
between a data source and destination using convolutional encoding. At the 
source, fc-bit data is received by a convolutional encoder 12. The convolutional 
encoder 12 generates an n-bit encoded data output based on the received data. 
The encoded data is transmitted to the destination through a transmission 
medium 14. During transmission, noise may be added to the encoded data, 
thereby corrupting some of the output. At the destination, the possibly 
corrupted data is received by Viterbi decoder 16. The Viterbi decoder recovers 
the original data; even if the encoded data is corrupted, the Viterbi decoder is 
able to recover the original data in many situations. 

[0027] For illustration of convolutional encoding, an example using a fc=l, n=2 
structure is shown in Figure 2. The encoder 12 receives the data to be encoded 
into a flip-flop 18 and two modulo-2 adders 20 and 22. The output of flip-flop 18 
is also received by an input of modulo-2 adder 20. The output of flip-flop 18 is 
also coupled to the input of flip-flop 24. The output of flip-flop 24 is coupled to 
an input of modulo-2 adder 20 and an input of modulo-2 adder 22. The encoded 
output XY of the convolution encoder 12 is the output of modulo-2 adder 20 (X) 
and modulo-2 adder 22 (Y). 

[0028] The convolutional encoder 12 has a constraint length (K) of 3, meaning 
that the current output is dependent upon the last three inputs. The dependency 
on previous values to affect the encoded data output allows the Viterbi decoder 
to reconstruct the data despite transmission errors. Convolutional decoders are 
often classified as (n,fc,K) encoders; hence the encoder shown in Figure 2 would 
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be a (2,1,3) encoder. The connection vectors, which define the connections 
between the shift register formed by flip-flops 18 and 24, for the encoder shown 
in Figure 2 are "111" for modulo-2 adder 20 and "101" for modulo-2 adder 22. 

[0029] The "state" of the encoder 12 is defined as the outputs of the flip-flops 
5 18 and 24. Thus the state of encoder 12 can be notated as "(output of TIT 18, output 

ofFF 24)". A state diagram for the encoder of Figure 2 is shown in Figure 3. 

Each of the four possible states (00, 01,10 and 11) is shown within a circle. 

Transitions between states are shown responsive to a data input of "0" (solid 

line) or a data input of "1" (dashed line). The two-bit value above the transition 
D 10 line is the resulting output XY. Thus, from a state of "00", an input of "0" will 
fij result in a return to "00" with an output of "00". An input of 1 will result in a 

jjj transition to "10" and an output of "11". 

aw 

?: 

B [0030] The state diagram of Figure 3 shows the transitions from any state at 

s 

U any given moment. In Figure 4, a "trellis" diagram is used to shown the 

i y 

fij 15 transitions over time. From an arbitrary time, T 2/ the trellis diagram of Figure 4 
pi shows the possible state transitions and outputs responsive to a given data input. 

[0031] Figure 5 shows an example of a path through the trellis using a data 
input sequence of "1011" from an initial state of "00". The initial data input "1" 
causes a transition from state "00 // to state "10" and an encoded output of "11". 
20 The next data input, "0", causes a transition from state "10" to state "01" and an 
encoded output of "10". The following data input, "1", causes a transition from 
state "01" to "10" and an encoded output of "00". The final data input, "1", 
causes a transition from state "10" to state "11 " and an encoded output of "01". 

[0032] The encoded output "11 10 00 01 " will be transmitted to a receiving 
25 device with a Viterbi decoder. The two-bit encoded outputs are used to 

reconstruct the data. By convention, a data transmission begins in state "00". 
Hence, the first encoded output "11" would signify that the first input data bit 
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was a "1" and the next state was "10". Assuming no errors in transmission, the 
data input could be determined by state diagram of Figure 2 or the trellis of 
Figure 3. 

[0033] However, in real-world conditions, the encoded data may be 
5 corrupted during transmission. In essence, the Viterbi decoder 16 traces all 
possible paths, maintaining a "path metric" for each path, which accumulates 
differences ("branch metrics") between the each of the encoded outputs actually 
received and the encoded outputs that would be expected for that path. The 
path with the lowest path metric is the maximum likelihood path. 

O 

p 10 [0034] Figure 6a illustrates computation of the branch metrics for the 

ry 

ffk transition from the initial state of "00". In this case, an "11" was received. With 

^ two-bit outputs, a "Hamming distance" may be used to calculate the branch 

O metric. The Hamming distance is the sum of exclusive-or operation on 

jU respective bits of the received output and the expected output. For the path 

fjj 15 assuming a "0" input, the branch metric between the received encoded output 
P ("ll") and the expected encoded output ("00") is two. For the path assuming a 

H "1" input, the branch metric between the received encoded output ("ll") and the 

expected encoded output ("00") is zero. Hence the path metric at state "00" at 
time Ti is two and the path metric at state "10" at time Ti is zero. The path 
20 metrics are shown above the states in the diagram. 

[0035] Figure 6b illustrates the path through time T2. In this example, it is 
assumed that there is a data transmission error, and the received encoded output 
is "11" rather than "10". Hence, at T 2/ the path metric is four for state "00", one 
for state "01", two for state "10" and one for state "11". 

25 [0036] Figure 6c illustrates the path through time T^ At this point, two 
potential paths are entering each state. For each state, the branch metric is 
computed for each path entering the state, and the path with the lowest path 
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metric is chosen (the " surviving path"). If two paths have the same path metric 
(such as state "01" at T3), a path can be chosen randomly or deterministically 
(such as by always choosing the upper path). 

[0037] Figure 6d shows the path through time T4. At this point, the actual 
5 path through states "10 01 10 11" has the lowest path metric. If the example 
sequence were longer, the path metrics for all other paths would increase as the 
path metric for the actual path remained the same (assuming no additional 
errors). When the end of a path is reached, the most likely path is determined 
through a process called "traceback". 

10 [0038] As can be seen in Figures 6a-d, for each time period, a branch metric 
calculation and path metric calculation must be performed for each path entering 

u t 

lif a state. Further, a comparison must be performed to determine the surviving 

par. 

m state. For the example shown in Figures 2-6, this is not terribly computation 

j\ intensive. But for larger trellis structures, for example a radix-4 trellis, the 

J 15 computations involved may necessitate a dedicated hardware decoder, rather 

iy 

M= than a software Viterbi decoder. 



[0039] The present invention is described in conjunction with a 16-state 
Viterbi decoder. The method of performing the decoding the encoded 
information using software uses a programmable processing device, such as the 
20 C60 series of digital signal processors from TEXAS INSTRUMENTS 

INCORPORATED. A simplified block diagram showing the pertinent features of 
such a processing device is shown in Figure 7. 

[0040] In the preferred embodiment, the processing device 40 includes one or 
more arithmetic units 42 capable of multiple field arithmetic and a plurality of 
25 registers 44, typically arranged in a register file 46. The arithmetic units 42 have 
the ability to perform separate logical and arithmetic operations on predefined 
fields 48 within their input registers 50 under program control (represented by 
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control logic 51). For example, if an arithmetic unit 42 uses 32-bit input registers, 
it could perform four simultaneous compares between four respective eight-bit 
fields 48 within the input registers 50. The method described herein takes 
advantage of the simultaneous operations in order to efficiently process 
5 information such that a software Viterbi decoding operation can be performed at 
a suitable speed. 

[0041] Figure 8 illustrates a prior art sixteen state Viterbi decoding stage 60. 
The sixteen states are notated in hexadecimal format as states 0-F. As shown in 
Figure 5, this same stage is used between consecutive time periods (T n , T n +i) 
Q 10 throughout the decoder. However, the computation involved in using this type 
S of decoder stage is too complex to accommodate a typical data rate using a 

j*f | software programmable device. 

IM 

00 [0042] Figure 9 illustrates an asymmetrical Viterbi decoding stage pair 70 that 

U increases the efficiency of computation when used with a processing device of 
**i 15 the type shown in Figure 7. The decoding stage pair is "asymmetrical" because 

g consecutive stages use different operations to perform the path metric 

H calculations. For reference, the pair includes "A-trellis" 72 and "B-trellis" 74. 

[0043] Figures lOa-d illustrate partial trellis diagrams showing how path 
metrics are concurrently calculated at four destination states. For example, 
20 referring to Figure 10a, at T n +i, new path metrics for states 0, 4, 8 and C are 

concurrently calculated. Similarly, at T n +2, new path metrics for states 0, 1, 2, and 
3 are concurrently calculated from the results of staes 0, 4, 8 and C at T n +i. The 
lines indicate which fields are used for the calculations of the new path metrics; 
the particular fields used could be different depending upon the implementation. 

25 [0044] Recapping the discussion of four-state Viterbi decoders, it can be seen 
from Figure 10a that the path metric for state 0 at T n +i is equal to the lowest of the 
sum of the path metrics at states 0, 4, 8 and C at T n added to the respective 
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branch metric between those states and state 0 at T n +i. To accurately describe the 
relationships, P(s,f) equals the path metric of state s at time t and B(si,S2) is the 
branch metric between state si and S2 based on the received encoded data. 
Hence: 

P(0,T„ + i) equals mintP^+B^O), P(4,T n )+B(4,0), P(8,T n )+B(8,0), P(C / T n )+B(C / 0)] 
P(4,T n+ i) equals min[P(l,Tn)+B(l,4), P(5,T„)+B(5,4), P(9,T„)+B(9,4), P(D / T„)+B(D / 4)] 
P(8,T n+ i) equals min[P(2,T n )+B(2,8), P(6 / T n )+B(6 / 8) / P(A,T n )+B(A / 8) / P(E,T n )+B(E,8)] 
P(QT n+ i) equals min[P(3,T n ) +B(3,C), P(7,T n )+B(7,C), P(B / T„)+B(C / Q / P(F,T n )+B(F,C)] 

[0045] The operations for Figure lOa-d will be discussed with reference to 
Figures 11, 12 and 13. Figure 11 shows an example of a register file allocation. 
Four registers are used as temporary registers, TMP(CL3). These registers are 
used for intermediate calculations. Eight registers are used for storing path 
metrics. Each of these registers holds path metric values for four states in 
respective fields. For example, the register storing CPM(048C) stores path metric 
at states 0, 4, 8 and C Of these eight, four registers are used to store the path 
metric calculated by the A-trellis 72 and four registers are used to store the path 
metrics calculated by the B-trellis 74. The registers used to store the path metrics 
for the A-trellis 72 are used as the most recently calculated path metrics for the 
calculations performed by the B-trellis 74. Likewise, the registers used to store 
the path metrics for the B-trellis 74 are used as the most recently calculated path 
metrics for the calculations performed by the A-trellis 72. 

[0046] Additionally, a number of registers storing branch metrics for the path 
metric calculations are provided. Because the branch metrics will depend upon 
the encoding scheme, their calculation will not be specified. No matter what 
method is used for calculating branch metrics, it should be possible to pre- 
calculate the branch metrics (as the data is received) for efficient calculation of 
the candidate path metrics. 
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[0047] Figure 12 illustrates a flow chart describing the calculations using the 
A-trellis 72. It should be noted that this flowchart is meant to describe the 
various calculations being made to implement the A-trellis and is not a detailed 
description of a particular order. Multiple arithmetic units, some steps can be 
5 performed concurrently for the greatest time efficiency. For clarity, the line 
between states (representing the path metric) in the Figures illustrating the 
trellises is depicted according to the field in the four-field word which stores the 
intermediate result according to the following legend: 

[0048] In block 80, variables q, r, and j are set to zero. For purposes of 
10 reference, the current time is T n +i. These variables are used as indices for various 
registers and states. In block 82, one input (IR1) is loaded with a path metric 
hi from the register file 46. On the first pass, therefore, CMP(0123) (see Figure 11) is 

K loaded into IR1. This register holds the path metrics computed for states 0, 4, 8 

* andCatTn. 

|bss 

fy 

fy 15 [0049] In block 84, the other input is loaded with branch metrics based on the 
p current data value, the source state and the destination state. There are four 

branch metrics in four fields. For the first pass, the branch metrics B(0,0), B(l,4), 
13(2,8) and B(3,C) are loaded. It is assumed that the branch metrics have been 
previously computed and packed into four-field words and stored in the register 
20 file 46. 

[0050] A multi-field addition is performed in step 86, where the first field 
from IR1 is added to the first field in IR2, the second field in IR1 is added to the 
second field in IR2, and so on. The result is stored in a temporary file TMP(j). 
Hence on the first pass, the result will be stored in TMP(0). Accordingly, at the 
25 end of the first pass (j=0), TMP(0) will store the candidate path metrics associated 
with a transition from state 0 (at T n ) to state 0 (at T n -ri), state 1 (at T n ) to state 4 (at 
T n +i), state 2 (at T n ) to state 8 (at T n +i), and state 3 (at T n ) to state C (at T n +i). 
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[0051] On each pass (i.e., for each increment of; in blocks 88 and 90), a new 
set of source states and branch metrics are used to calculate additional candidate 
path metrics. Thus, on the second pass (/=1), TMP(l) stores the candidate path 
metrics associated with a transition from state 4 (at T n ) to state 0 (at T n +i), state 5 
(at T n ) to state 4 (at T n +i), state 6 (at T n ) to state 8 (at T n +i), and state 7 (at T n ) to 
state C (at T n +i). On subsequent passes, TMP(2) stores the candidate path metrics 
associated with a transition from state 8 (at T n ) to state 0 (at T n +i), state 9 (at T n ) to 
state 4 (at T n +i), state A (at T n ) to state 8 (at T n +i), and state B (at T n ) to state C (at 
Tn+i). TMP(3) stores the candidate path metrics associated with a transition from 
state C (at T n ) to state 0 (at T n+ i), state D (at T n ) to state 4 (at T n +i), state E (at T n ) to 
state 8 (at T n +i), and state F (at T n ) to state C (at T n +i). 



Table 1 Content of TMP Registers After First Group 





State 0 


State 4 


State 8 


State C 


TMP(O) 


P(0,T n )+B(0,0) 


P(1,T„)+B(1,4) 


P(2,T„)+B(2,8) 


P(3,T„)+B(3,C) 


TMP(l) 


P(4,T n )+B(4,0) 


P(5,T n )+B(5,4) 


P(6,T„)+B(6,8) 


P(7,T„)+B(7,C) 


TMP(2) 


P(8,T„)+B(8,0) 


P(9/T n )+B(9,4) 


P(A,T n )+B(A / 8) 


P(B / T„)+B(C,C) 


TMP(3) 


P(C,T n )+B(C,0) 


P(D / T n )+B(D / 4) 


P(E,T„)+B(E / 8) 


P(F,T„)+B(F,C) 



[0052] For the first group of destination states (0,4,8,C), when;=3 in block 88, 
the four temporary registers TMP(0..3) hold, in respective fields, the candidate 
path metrics of the four possible transitions to states 0, 4, 8 and C, as shown in 
Table 1. In block 92, the respective fields of TMP(O) and TMP(l) are compared 
and the paths with the lowest path metric for each field are selected and stored 
back in TMP(0). In block 94, the respective fields of TMP(2) and TMP(3) are 
compared and the paths with the lowest path metric for each field are selected 
and stored in TMP(l). Finally, in block 96, TMP(0) and TMP(l) are compared 
and the lowest path metric for each field is stored in the appropriate register 
associate with the states being evaluated. For the A-Trellis of Figure 10a, this 
would be CMP(048C). 
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[0053] In blocks 98 and 100, the same flow as described above is used to 
determine the lowest cost path for states associated with CMP(159D) / 
CMP(26AE), and CMP(37BF). The contents of the TMP registers prior to the 
comparisons are illustrated below in Tables 2-4. 



Table 2 Content of TMP Registers After Second Group 





State 1 


State 5 


State 9 


State D 


TMP(O) 


P(0,T„)+B(0,1) 


pa^o+Ba^) 


P(2,T n )+B(2,9) 


P(3,T n )+B(3,D) 


TMP(l) 


P(4,T n )+B(4,l) 


P(5,T n )+B(5,5) 


P(6 / T n )+B(6 / 9) 


P(7,T n )+B(7,D) 


TMP(2) 


P(8,T„)+B(8,1) 


P(9,T n )+B(9,5) 


P(A,T n )+B(A,9) 


P(B,T n )+B(C,D) 


TMP(3) 


P(C,T„)+B(C,1) 


P(D,r n )+B(D,5) 


P(E,T n )+B(E,9) 


P(F,T„)+B(F,D) 



Table 3 Content of TMP Registers After Third Group 





State 2 


State 6 


State A 


State E 


TMP(O) 


P(0,T„)+B(0,2) 


P(l / T n )+B(l,6) 


P(2,T n )+B(2,A) 


P(3,T n )+B(3 / E) 


TMP(l) 


P(4,T n )+B(4,2) 


P(5,T n )+B(5,6) 


P(6,T n )+B(6,A) 


P(7 / T n )+B(7,E) 


TMP(2) 


P(8,T n )+B(8,2) 


P(9,T n )+B(9,6) 


P^A^+BC^A) 


P(B,Tn)+B(C,E) 


TMP(3) 


P(C,T n )+B(C,2) 


P(D / T„)+B(D / 6) 


P(E,T n )+B(E,A) 


P(F,T n )+B(F / E) 



Table 4 Content of TMP Registers After Fourth Group 





State 3 


State 7 


State B 


State F 


TMP(O) 


P(0,T n )+B(0,3) 


P(1,T„)+B(1,7) 


P(2 / T n )+B(2,B) 


P(3 / T n )+B(3,F) 


TMP(l) 


P(4 / T n )+B(4 / 3) 


P(5 / T r .)+B(5 / 7) 


P(6,T n )+B(6 / B) 


P(7,T n )+B(7,F) 


TMP(2) 


P(8,T n )+B(8 / 3) 


P(9,T n )+B(9,7) 


P(A,T n )+B(A,B) 


P(B,T r )+B(C,F) 


TMP(3) 


P(C / T n )+B(C,3) 


P(D / T n )+B(D,7) 


P(E,T n )+B(E,B) 


P(F,T n )+B(F,F) 



[0054] The operation of the processing device 40 to implement the B-Trellis 74 



is somewhat different. As shown m Figure 10a, the first group of destination 
registers for this trellis includes states 0, 1, 2 and 3. The computation of the 
candidate path metrics for these groups is based on a single set of four source 
states: states 0, 4, 8, and C (the destination states for the preceding A-Trellis 72). 
The preferred implementation of this trellis rotates the contents of the registers 
computed in the previous A-Trellis 72 to derive the four candidate path metrics 
for each destination state. 
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[0055] A flow chart describing the implementation of the B-Trellis 74 is given 
in Figure 13. In Figures 14a-d, the four passes for the first group (destination 
states 0, 1, 2, 3) are separated for reference. The final path metrics for each 
destination state are: 

P(0,T„ + i) equals min[P(0,T n )+B(0,0), P(4,T n )+B(4,0), P(8,T„)+B(8,0), P(C,T n )+B(C,0)] 
P(l,T„ + i) equals min[P(4 / T n )+B(4 / l) / P(8,T n )+B(8,l), P(C,T„)+B(C,1), P(0,T n )+B(0,l)] 
P(2,T n+ i) equals min[P(8,T n )+B(8,2), P(C / T„)+B(C / 2) / P(0,T„)+B(0,2), P(4,T n )+B(4,2)] 
P(3,T n+ i) equals min[P(C,T n )+B(C,3), P(0,T n )+B(0,3), P(4,T n )+B(4,3), P(8,T n )+B(8,3)] 

[0056] In block 110, the indices are initialized. In block 112, IR1 is loaded with 
a register 44 storing a set of previously calculated path metrics. For the first 
group of destination states (shown in Figures 14a-d), CMP(048C) is loaded into 
IR1. For the second, third and fourth groups of destination states, CMP(159D), 
CMP(26AE) and CMP(37BF), respectively, will be loaded. 

[0057] In block 114, the appropriate branch metrics are loaded into respective 
fields of input register IR2. On the first pass (C=0), the four fields are set to 
B(r,q) | B(r+4,q+l) | B(r+8,q+2) | B(r+C,q+3). On the second pass, the four fields 
are set to B(r+4,q) | B(r+8,q+l) | B(r+C,q+2) | B(r,q+3). On the third pass, the four 
fields are set to B(r+8,q) | B(r+C,q+1) | B(r,q+2) | B(r+4,q+3). On the fourth pass, 
the four fields are set to B(r+C,q) | B(r,q+1) | B(r+4,q+2) i B(r+8,q+3). 

[0058] In block 116, the multi-field addition is performed, rendering one 
candidate path metric for each of the tour destination states, which is stored in a 
TMP register. In block 118 and 120, fields in IR1 are rotated on each pass. The 
reason for rotating the fields is that each of the input states are used in a 
candidate path metric calculation over four passes. Thus, for example, source 
state 0 (T n ) is used for a path metric calculation for destination state 0 in the first 
pass, for destination state 1 in the second pass, destination state 2 in the third 
pass and destination state 3 in the fourth pass. The rotation aligns the source 
state with the proper field for the multi-field add operation (see Tables 5-8, 
below). 
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Table 5 Content of TMP Registers After First Group 







State 0 


State 1 


State 2 


State 3 




TMP(O) 


P(0 / Tn)+B(0,0) 


P(4,T n )+B(4 / l) 


P(8,T n )+B(8,2) 


P(C,Tn)+B(C / 3) 




TMP(l) 


P(4,Tn)+B(4,0) 


P(8,T n )+B(8,l) 


P(C,T n )+B(C2) 


P(0,T n )+B(0,3) 




TMP(2) 


P(8,Tn)+B(8,0) 


P(C,T n )+B(C,l) 


P(0,T„)+B(0,2) 


P(4/r n )+B(4,3) 




TMP(3) 


P(C,Tn)+B(C,0) 


P(0,T n )+B(0,l) 




P(8 / T n )+B(8 / 3) 






Table 6 Content of TMP Registers After Second Group 






State 4 


State 5 


State 6 


State 7 




TMP(O) 


P(l,T n )+B(l,4) 


P(5 / T n )+B(5 / 5) 


P(9,T n )+B(9,6) 


P(D,T n )+B(D ; 7) 




TMP(l) 


P(5,T n )+B(5,4) 


P(9/r n )+B(9,5) 


P(D,T n )+B(D,6) 


PaXJ+BCL^) 




TMP(2) 


P(9,Tn)+B(9 / 4) 


P(D / lnHB(D,5) 


P(1,T„)+B(1,6) 


P(5/Tn)+B(5,7) 




TMP(3) 


P(D,T n )+B(D,4) 


P(l,T n )+B(l,5) 


P(5,Tn)+B(5,6) 


P(9,T n )+B(9,7) 


o 

fit 




Table 7 Content of TMP Registers After Third Group 


m 




State 8 


State 9 


State A 


State B 


lii 


TMP(O) 


P(2 / T n )+B(2,8) 


P(6,T n )+B(6,8) 


P(A,Tn)+B(A,A) 


P(E,T n )+B(E,B) 


Liu 


TMP(l) 


P(6 / T n )+B(6,8) 


P(A,T n )+B(A,8) 


P(E,T n )+B(E,A) 


P(2,T n )+B(2,B) 


s 


TMP(2) 


P(A / T„)+B(A / 8) 


P(E / T n )+B(E,8) 


P(2,T n )+B(2,A) 


P(6,T n )+B(6,B) 


fx* 


TMP(3) 


P(E,T n )+B(E,8) 


P(2,T n )+B(2,8) 


P(6,T n )+B(6,A) 


P(A,T n )+B(A,B) 


ps 

t 




Table 8 Content of TMP Registers After Fourth Group 






State C 


State D 


State E 


State F 




TMP(O) 


P(3,T n )+B(3,C) 


P(7,T n )+B(7,D) 


P(B,T n )+B(B,E) 


P(F,Tn)+B(F,F) 




TMP(l) 


P(7,T„)+B(7,Q 


P(B,T n )+B(B,D) 


P(F,T I1 )+B(F,E) 


P(3,T n )+B(3,F) 




TMP(2) 


P(B,T n )+B(B,C) 


P(F,T n )+B(F,D) 


P(3,T n )+B(3,E) 


P(7 / T n )+B(7 / F) 




TMP(3) 


P(F,T n )+B(F,C) 


P^/f^B^D) 


P(7,T n )+B(7,E) 


P(B,T n )+B(B,F) 



5 [0059] After the four passes are complete, the compare operations of blocks 



122, 124 and 126 are implemented, similar to those shown in blocks 92-96 of 
Figure 12. In block 122, the respective fields of TMP(O) and TMP(l) are 
compared and the paths with the lowest path metric for each field are selected 
and stored back in TMP(O). In block 124, the respective fields of TMP(2) and 
10 TMP(3) are compared and the paths with the lowest path metric for each field are 
selected and stored in TMP(l). Finally, in block 126, TMP(O) and TMP(l) are 
compared and the lowest path metric for each field is stored in the appropriate 
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register associate with the states being evaluated. For the A-Trellis of Figure 10a, 
this would be CMP(0123). 

[0060] After the selection of the lowest path metric for each of the destination 
states in the group, the next group is selected by blocks 128 and 130, until all the 
5 path metrics are complete. The path metrics computed by the B-trellis are used 
by the next A-trellis for computation of the next set of path metrics. 

[0061] The present invention provides significant advantages over the prior 
art. By rearranging the trellises, multi-field additions and comparisons can be 
used to greatly speed the computations of the Viterbi decoder, thereby allowing 
O 10 a software decoder to be implemented. 

fp [0062] Although the Detailed Description of the invention has been directed 



to certain exemplary embodiments, various modifications of these embodiments, 
as well as alternative embodiments, will be suggested to those skilled in the art 
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The invention encompasses any modifications or alternative embodiments that 
fall within the scope of the Claims. 
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